Picture for Junwei Bao

Junwei Bao

GVPO: Group Variance Policy Optimization for Large Language Model Post-Training

Add code
Apr 28, 2025
Viaarxiv icon

Energy-Based Preference Model Offers Better Offline Alignment than the Bradley-Terry Preference Model

Add code
Dec 18, 2024
Viaarxiv icon

Preference-Oriented Supervised Fine-Tuning: Favoring Target Model Over Aligned Large Language Models

Add code
Dec 17, 2024
Figure 1 for Preference-Oriented Supervised Fine-Tuning: Favoring Target Model Over Aligned Large Language Models
Figure 2 for Preference-Oriented Supervised Fine-Tuning: Favoring Target Model Over Aligned Large Language Models
Figure 3 for Preference-Oriented Supervised Fine-Tuning: Favoring Target Model Over Aligned Large Language Models
Figure 4 for Preference-Oriented Supervised Fine-Tuning: Favoring Target Model Over Aligned Large Language Models
Viaarxiv icon

BoRA: Bi-dimensional Weight-Decomposed Low-Rank Adaptation

Add code
Dec 09, 2024
Viaarxiv icon

Interactive-T2S: Multi-Turn Interactions for Text-to-SQL with Large Language Models

Add code
Aug 09, 2024
Figure 1 for Interactive-T2S: Multi-Turn Interactions for Text-to-SQL with Large Language Models
Figure 2 for Interactive-T2S: Multi-Turn Interactions for Text-to-SQL with Large Language Models
Figure 3 for Interactive-T2S: Multi-Turn Interactions for Text-to-SQL with Large Language Models
Figure 4 for Interactive-T2S: Multi-Turn Interactions for Text-to-SQL with Large Language Models
Viaarxiv icon

Interactive-KBQA: Multi-Turn Interactions for Knowledge Base Question Answering with Large Language Models

Add code
Feb 23, 2024
Figure 1 for Interactive-KBQA: Multi-Turn Interactions for Knowledge Base Question Answering with Large Language Models
Figure 2 for Interactive-KBQA: Multi-Turn Interactions for Knowledge Base Question Answering with Large Language Models
Figure 3 for Interactive-KBQA: Multi-Turn Interactions for Knowledge Base Question Answering with Large Language Models
Figure 4 for Interactive-KBQA: Multi-Turn Interactions for Knowledge Base Question Answering with Large Language Models
Viaarxiv icon

HopPG: Self-Iterative Program Generation for Multi-Hop Question Answering over Heterogeneous Knowledge

Add code
Sep 10, 2023
Viaarxiv icon

AUGUST: an Automatic Generation Understudy for Synthesizing Conversational Recommendation Datasets

Add code
Jun 16, 2023
Figure 1 for AUGUST: an Automatic Generation Understudy for Synthesizing Conversational Recommendation Datasets
Figure 2 for AUGUST: an Automatic Generation Understudy for Synthesizing Conversational Recommendation Datasets
Figure 3 for AUGUST: an Automatic Generation Understudy for Synthesizing Conversational Recommendation Datasets
Figure 4 for AUGUST: an Automatic Generation Understudy for Synthesizing Conversational Recommendation Datasets
Viaarxiv icon

SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic Segmentation

Add code
Nov 27, 2022
Figure 1 for SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic Segmentation
Figure 2 for SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic Segmentation
Figure 3 for SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic Segmentation
Figure 4 for SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic Segmentation
Viaarxiv icon

MoNET: Tackle State Momentum via Noise-Enhanced Training for Dialogue State Tracking

Add code
Nov 11, 2022
Figure 1 for MoNET: Tackle State Momentum via Noise-Enhanced Training for Dialogue State Tracking
Figure 2 for MoNET: Tackle State Momentum via Noise-Enhanced Training for Dialogue State Tracking
Figure 3 for MoNET: Tackle State Momentum via Noise-Enhanced Training for Dialogue State Tracking
Figure 4 for MoNET: Tackle State Momentum via Noise-Enhanced Training for Dialogue State Tracking
Viaarxiv icon